THEOREMS FOR A PRICE: Tomorrow's Semi-Rigorous Mathematical Culture 



Doron Zeilberger 1 

Today 

The most fundamental precept of the mathematical faith is thou shalt prove everything rigorously. 
While the practitioners of mathematics differ in their view of what constitutes a rigorous proof, and 
there are fundamentalists who insist on even a more rigorous rigor than the one practiced by the 
mainstream, the belief in this principle could be taken as the defining property of mathematician. 

The Day After Tomorrow 

There are writings on the wall that, now that the silicon savior has arrived, a new testament is 
going to be written. Although there will always be a small group of "rigorous" old-style mathe- 
maticians^. g. [JQ]) who will insist that the true religion is theirs, and that the computer is a false 
Messiah, they may be viewed by future mainstream mathematicians as a fringe sect of harmless 
eccentrics, like mathematical physicists are viewed by regular physicists today. 

The computer has already started doing to mathematics what the telescope and microscope did to 
astronomy and biology. In the future, not all mathematicians will care about absolute certainty, 
since there will be so many exciting new facts to discover: mathematical pulsars and quasars 
that will make the Mandelbrot set seem like a mere Jovian moon. We will have (both human and 
machine 2 ) professional theoretical mathematicians, who will develop conceptual paradigms to make 
sense out of the empirical data, and who will reap Fields medals along with (human and machine) 
experimental mathematicians. Will there still be a place for mathematical mathematicians? 

This will happen after a transitory age of semi-rigorous mathematics, in which identities (and 
perhaps other kinds of theorems) will carry price-tags. 

A Taste Of Things To Come 

To get a glimpse of how mathematics will be practiced in the not too far future, I will describe 
the case of algorithmic proof theory for hypergeometric identities\WZ*] [Z*] [Ca] . In this theory, 
it is possible to rigorously prove, or refute, any conjectured identity belonging to a wide class of 
identities, that includes most of the identities between the classical special functions of mathematical 
physics. 
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For example, my computer Shalosh B. Ekhad, and its friend Sol Tre, already have a non-trivial publication list, e.g. 
[E], [ET] 



Any such identity is proved by exhibiting a proof certificate, that reduces the proof of the given 
identity to that of a finite identity among rational functions, and hence, by clearing denominators, 
to that between specific polynomials. 

This algorithm can be performed successfully on all "natural identities" we are now aware of. 
It is easy, however, to concoct artificial examples for which the running time, and memory, are 
prohibitive. Undoubtedly, in the future "natural" identities will be encountered whose complete 
proof will turn out to be not worth the money. We will see later, how, in such cases, one can get 
"almost certainty" with a tiny fraction of the price, along with the assurance that if we robbed a 
bank, we would be able to know for sure. 

This is vaguely reminiscent of transparent proofs introduced recently in theoretical computer sci- 
cncc[Ci][ALMSS][AS]. The result that there exist short theorems having arbitrarily long proofs, a 
consequence of Godel's incompleteness theorem, also comes to mind [S]. 3 I speculate that similar 
developments will occur elsewhere in mathematics, and will "trivialize" large parts of mathematics, 
by reducing mathematical truths to routine, albeit possibly very long, and exorbitantly expensive 
to check, "proof certificates". These proof certificates would also enable us, by plugging in random 
values, to assert "probable truth" very cheaply. 

Identities 

Many mathematical theorems are identities: statements of type "=", which take the form A = B. 
Here is a sampler, in roughly an increasing order of sophistication. 

1. 2 + 2 = 4. 

2. (a + bf = a 3 + 3a 2 b + 3ab 2 + b 3 . 

3. sin(x + y) = sin(x)cos(y) + cos(x)sin(y). 

4. F^F^ - Fl = • 

5. (a + 6)" = £Lo GjK& n - fc ■ 

6- £Z=-„(-i) fc tt) 8 = ffl- 

7. Let (q) r := (1 - g)(l - q 2 ) ... (1 - q r ), then 

^ q r2 ^ (_l)r g (5r*-r)/2 

(q) r {q) n - r (q) n - r (q) n+r 

7'. Let (q) r be as in 7. 



3 Namely, the ratio (proof length)/(theorem length) grows fast enough to be non- recursive. Adding an axiom can 
shorten proofs by recursive amounts [G], [D]. 
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OO r 2 OO 

E^r = IK 1 -^ +1 )" 1 ( 1 -^ +4 )" 1 

r=0 i=0 



8. Let H n be given by, 

n ~ nW "(i-g)(i-9 2 )""-(i-^) ' 

then 

/ " 2(- g "+i)fc V " 4(- g ) fc H n+k H n _ k " fc „ 4 

lrO^ + ^~ 7 * (l+^) 2 ^ *n =( fc ^ (_9) } 

\fc=0 / k= — n k=—n 

8'. 



fc= — oo 



9. Analytic Index= Topological Index. 

10. Re(s) = \ for every non-real s such that £(s) = 0. 

All the above identities are trivial, except possibly the last two, which I think quite likely will be 
considered trivial in two hundred years. I will now explain. 

Why are the first 8 identities trivial? 

The first identity, while trivial nowadays, was very deep when it was first discovered, independently, 
by several anonymous cave-dwellers. It is a general, abstract theorem, that contains, as special 
cases, many apparently unrelated theorems: Two bears and Two bears make Four bears, Two 
apples and Two apples make Four apples, etc. It was also realized that in order to prove it 
rigorously, it suffices to prove it for any one special case, say, marks on the cave's wall. 

The second identity: (a+6) 3 = a 3 +3a 2 6+3a& 2 +6 3 , is of one level of generality higher. Taken literally 
(in the semantic sense of the word literally), it is a fact about numbers. For any specialization of 
a and b we get yet another correct numerical fact, and as such it requires a "proof", invoking the 
commutative, distributive and associative "laws". However, it is completely routine when viewed 
literally, in the syntactic sense, i.e. in which a and b are no longer symbols denoting numbers, but 
rather represent themselves, qua (commuting) literals. This shift in emphasis roughly corresponds 
to the transition from Fortran to Maple, i.e. from numeric computation to symbolic computation. 
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Identities 3 and 4 can be easily embedded in classes of routinely verifiable identities in several ways. 
One way is by defining cos(x) and sin(x) by (e lx + e~ lx )/2 and (e lx — e~ lx )/(2i), and the Fibonacci 
numbers F n by Binet's formula. 

Identities 5-8 were, until recently, considered genuine non-trivial identities, requiring a human 
demonstration. One particularly nice human proof of 6, was given by Cartier and Foata[CF]. A 
one-line computer-generated proof of identity 6 is given in [E]. Identities 7 and 8 are examples 
of so-called q-binomial coefficient identities, (a.k.a. terminating q-hypergeometric series.) All such 
identities are now routinely provable[WZ2](see below.) The machine-generated proofs of 7 and 
8 appear in [ET] and [AEZ] respectively. Identities 7 and 8 immediately imply, by taking the 
limit n — ► oo, identities 7' and 8', which, in turn, are equivalent to two famous number-theoretic 
statements: The first Rogers- Ramanuj an identity, that asserts that the number of partitions of an 
integer into parts that leave remainder 1 or 4 when divided by 5 equals the number of partitions of 
that integer into parts that differ from each other by at least 2, and Jacobi's theorem that asserts 
that the number of representations of an integer as a sum of 4 squares, equals 8 times the sum of 
its divisors that are not multiples of 4. 

The WZ Proof Theory 

Identities 5-8 involve sums of the form 

n 

^2 F(n, k) , (Sum) 

k=0 

where the summand, F(n,k), is a hyper geometric term, (in 5, and 6), or a q-hypergeometric term, 
(in 7, and 8), in both n and k, which means that both quotients F(n + 1, k)/F(n, k) and F(n, k + 
1)/F(n,k) are rational functions of (n,k) ((q n ,q h ,q) respectively). 

For such sums, and multi-sums, we have ([WZ2]) the following result. 

The Fundamental Theorem of Algorithmic Hypergeometric Proof Theory: 

Let F(n\ k\, . . . , k r ) be&proper (see [WZ2]) hypergeometric term in all of (n; k\, . . . , k r ). Then there 
exist polynomials Po(n), . . . ,Pi(n) and rational functions Rj{n; k\, . . . , k r ) such that Gj := RjF 
satisfy 

L 

^2pi(n)F(n + i;ki,...,k r ) = 

i=0 

r 

~^2[Gj(n; k\, . . . , kj + 1, . . . , k r ) — Gj(n; k\, . . . , kj, . . . , k r )] (multiWZ) 
and hence, if for every specific n, F(n; —) has compact support in (ki, . . . , k r ), the definite sum, 



4 



g(n), given by 



9{n) ■= ^2 F(n;fei,...,fe r ) 



(multiSum) 



ki , . . . ,fe r 



satisfies the linear recurrence equation with polynomial coefficients: 



L 



^2pi(n)g(n + i) = 



(P — recursive) 



i=0 



(P — recursive) follows from (multiW Z) by summing over {fci, . . . , &v}, and observing that all the 
sums on the right telescope to zero. 

If the recurrence happens to be first order, i.e. L = 1 above, then it can be written in closed form: 
for example the solution of the recurrence (n + l)g(n) — g(n + 1) = 0, g(0) = 1, is g(n) = nl. 

This "existence" theorem also implies an algorithm for finding the recurrence (i.e. the pi) and the 
accompanying certificates Rj, see below. 

An analogous theorem holds for q-hypergeometric series[WZ2][K]. 

Since we know how to find, and prove, the recurrence satisfied by any given hyper geometric sum 
or multi-sum, we have an effective way of proving any equality of two such sums, or the equality 
of a sum with a conjectured sequence. All we have to do is check whether both sides are solutions 
of the same recurrence, and match the appropriate number of initial values. Furthermore, we can 
also use the algorithm to find new identities. If a given sum yields a first-order recurrence, it can 
be solved, as mentioned above, and the sum in question turns out to be explicitly evaluable. If 
the recurrence obtained is of higher order, then most likely the sum is not explicitly-evaluable (in 
closed form), and Petkovsek's algorithm[P], that decides whether a given linear recurrence (with 
polynomial coefficients) has closed form solutions, can be used to find out for sure. 

Almost Certainty For An e Of The Cost 

Consider identity (multiSum) once again, where g(n) is "nice". Dividing through by gin) and 
letting F — > F/g (Herb Wilf's wonderful trick), we can assume that we have to prove an identity 
of the form 



The WZ theory promises you that the left side satisfies some linear recurrence, and if the identity is 
indeed true, then the sequence g(n) = 1 should be a solution (in other words po (n) + . . -+PL{n) = 1). 




(Nice) 



k i j . . . . kf 
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For the sake of simplicity, let's assume that the recurrence is minimal, i.e. is g(n + 1) — g(n) = 0, 
(this is true anyway in the vast majority of the cases.) To prove the identity, by this method, we 
have to find rational functions Rj(n; ki, . . . , k r ) such that the Gj := RjF satisfy: 

F(n + 1; ki, . . . , k r ) — F(n; ki,...,k r ) = 

r 

^^[Gj(n; k\, . . . ,kj + 1, . . . , k r ) — Gj (n; ki, . . . , kj, . . . , k r )} . {multiW Z') 

By dividing (multiW Z') through by F, and clearing denominators, we get a certain functional 
equation for the Ri, . . . , R r , from which it is possible to determine their denominators Qi, . . . ,Q r . 
Writing Rj = Pj/Qj, the proof boils down to finding polynomials Pj(ki, . . . ,k r ), with coefficients 
that are rational functions in n and possibly other (auxiliary) parameters. It is easy to predict 
upper bounds for the degrees of the Pj in (k 1: . . . , k r ). We then express each Pj symbolically, with 
"undetermined" coefficients, and substitute into the above-mentioned functional equation. We 
then expand, and equate coefficients of all monomials k^ 1 . . . k^ r , and get an (often huge) system 
of inhomogeneous linear equations with symbolic coefficients. The proof boils down to proving that 
this inhomogeneous system of linear equations has a solution. It is very time-consuming to solve a 
system of linear equations with symbolic coefficients. By plugging in specific values for n and the 
other parameters, if present, one gets a system with numerical coefficients, which is much faster 
to handle. Since it is unlikely that a random system of inhomogeneous linear equations with more 
equations than unknowns can be solved, the solvability of the system for a number of special values 
of n and the other parameters is a very good indication that the identity is indeed true. It is a 
waste of money to get absolute certainty, unless the conjectured identity in question is known to 
imply the Riemann Hypothesis. 

Semi-Rigorous Mathematics 

As wider classes of identities, and perhaps even other kinds of classes of theorems, become routinely- 
provable, we might witness many results for which we would know how to find a proof (or refutation), 
but we would be unable, or unwilling, to pay for finding such proofs, since "almost certainty" can 
be bought so much cheaper. I can envision an abstract of a paper, c. 2100, that reads : "We 
show, in a certain precise sense, that the Goldbach conjecture is true with probability larger than 
0.99999, and that its complete truth could be determined with a budget of $105." 

It would be then OK to rely on such a priced theorem, provided that the price is stated explicitly. 
Whenever statement A, whose price is p, and statement B, whose price is q, are used to deduce 
statement C, the latter becomes a priced theorem priced at p + q. 

If a whole chain of boring identities would turn out to imply an interesting one, we might be 
tempted to redeem all these intermediate identities, but we would not be able to buy out the whole 
store, and most identities would have to stay unclaimed. 
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As absolute truth becomes more and more expensive, we would sooner or later come to grips with 
the fact that few non-trivial results could be known with old-fashioned certainty. Most likely we will 
wind up abandoning the task of keeping track of price altogether, and complete the metamorphosis 
to non-rigorous mathematics. 

Note: Maple programs for proving hyper geometric identities are available by anonymous ftp to 
math, temple . edu, in directory pub/zeilberger/programs. a Mathematica implementation of the 
single-summation program, can be obtained from Peter Paule at paule@risc.uni-linz.ac. at . 
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